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(54) Method and apparatus for generating a platform-specific compiler 



(57) Syslenns and methods for building a platform 
specific compiler having an embedded inslructjon se- 
lector in a multi-platform environment are provided. A 
set of user defined platform dependent cc^pi^er archi- 
tecture descriptors that describe corresponding archi- 
tectiira! features and a set of instruction predicates de- 



fining those insTructions to be selected are converted to 
platform dependent con^ptler object code and instruc- 
tion selector object code, respectively The platform 
specific compiler having the embedded instruction se- 
fector is formed from the platform dependent compiler 
object code, the instruction selector object code, and the 
platform independent compiler object code. 
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Description 

BACKGROUND OF THE INVENTION 

1. Field of Invention 

[0001] The invention reiates generalfy to computer 
systems. More parlicuiariy, melhods and apparatus lot 
selecting executable instructions in a multi-platfomn 
computing environment are disclosed. 

2. Description of Relevant Art 

[0002] The continuing proliferation of software plat- 
forms and hardware architectures ensures that both 
computer users and computer program developers will 
encounter many different computing environments in 
the course of their careers It should be noted that in the 
context of this discussion, the teim environment refers 
to the complete range of elements in a computing sys- 
tem that interact with the ported software. These ele- 
ments typicalfy include a processor and operating sys- 
tem as well as I/O devices, libraries, netv^orks, or in 
some cases, a larger human or physical system. Even 
though a few quasi-standard platforms (e.g. IBM-PC, 
UNIX) have become widely used there, as yet, is no uni- 
versal computing environment. In order to maintain and 
expand their viability, therefore, most software pro- 
grams will eventually face the need to be ported, such 
that an executable version of tho software program 
based on the existing version is created in the new com- 
puting envirc^iment. Portability^ or the ability of a sofi- 
ware program to be ported to a given environment (i.e. , 
the target) is, therefore, becoming universally recog- 
nized as a desirable attribute for most software pro- 
grams. It is clear, therefore, that porlabilily between dif- 
ferent computing platforms enhances the value of a soft- 
ware program both by extendmg Its useful lifecycle and 
by expanding the range of installations in which it can 
be readily used. As is well known in the art, a software 
program can include an application program, a system 
program, or a component of a program whereas a soft- 
ware system is a collection of software programs. 
[0003] A software program is portable if and to the de- 
gree that the cost of porting is less than the cost of re- 
writing the program in the new target environment. A 
software program would be perfectly portable if It could 
be ported at zero cost and, of course, this is never pos- 
sible in practice. In practice there are two basic porta- 
bility protocols, the first being binary portability (i.e., 
porting the executable form of the software program) 
and the second being source portability (i.e., porting the 
source language representation of the software pro- 
gram). Although binary portability protocols typically of- 
fer several advantages (related primarily to ease of port- 
ing) H can only be used to port software programs across 
strc3igly similar environments thereby severely limiting 
its usefulness, in contrast, since source portability pro* 



locols assume availabiNty of a source code, they typi- 
cally provide a greater ability to adapt a particular soft- 
ware program to a wider range of computing environ- 
ments. 

^ [0CK)4J Unlortunately, most of the porting process is 
still done by ad hoc methods that result in inefficient 
techniques thai add substantially to the costs of porting 
software from one platform io another. By way of exam- 
ple, a compiler translates a computer program from one 

10 language into another catching any errors in syntax 
along the way. Most commonly, a compiler translates 
some high level language, such as C++ or COBOU into 
machine language such that the computer can under- 
stand without any translation. In order to fully portacom- 
piler, therefore, several tasks must be accomplished in 
order for the ported compiler to be able to successfully 
and in a highly reliable manner, perform its designed 
functions while operating in a totally different platform 
than the one it was originally conceived. 

20 [0005] The several tasks required to be accomplished 
in order to fully port a compiler include proper instruction 
selection since generally many different instruction 
types can match the same machine independent se- 
mantics. A simple example is the operation defined as 

25 adding a constant of "1" to a value where the value of 
"1 " can be represented as either an S bit or a 32 bit pre- 
cision Integer number. \n another example, for an X86 
processor found in the Pentium® and Pentium lid) line 
of microprocessors, the floating point unit (FPU) has 3 

30 precision modes in which it can perform various opera- 
tions, such as addition and subtraction. In the case 
where the senn^tlcs require rounding to, for example 
24 bit precision, and an FPU contrc^ word has set the 
FPU precision to be, for example 53 bits, It would be 

3S inefficient and incorrect for the XS6 compiler to select 
instructions defined in the architecture description that 
produce 53 bits of precision without introducing addi- 
ticHiai rounding. 

[0006] Other examples include multiple hardware 

^0 platforms, such as the SPARC microprocessor config- 
ured as a VB or a V9 processor When configured as a 
VQ system, only V6 type instructions can be executed, 
however, when configured as the V9 system, either V8 
type or V9type instructions can be executed. Therefore, 

45 tt is essential that in those cases where the V8 system 
is operating that only V8 type instructions be selected 
since V9 instructions can not execute on the V8 system. 
[0007] It is also desirable to select not only those in- 
structions that will properly execute on a particular plat- 

^0 form, but also select those instructions that improve the 
overall perfornnance of the processor by reducing the 
"cost" of execution. By way of example, storing the re* 
suits of a particular operation, such as a subtraction, in 
a memory location is generally more computer resource 

ss Intensive (i.e. , more costly) than storing the same result 
In a data register Therefore, it would be more cost ef- 
fective, where possible: to select the instructlc^ whose 
cost is the least of all those instructions that could pos- 
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sibly be used. Using the example above, it would make 
sense Irom a cost effectiveness standpoint to select the 
instruction that stores its result in a register as opposed 
to those instructions that store their respective result in 
a location in memory. 

[0008] Therefore, what fs desired Is the capabiHty of 
defining a. selection protocol whereby not only are the 
proper inslructions selected to execute in a mulU-plal- 
form computing environment, but the cost of executing 
these selected instructions is minimized. 

SUMMARY OF THE INVENTION 

[0009] Broadly speaking, the invention relates to an 
improved method, apparatus and computer system lor 
building a compiler having an instruction selector in a 
multi-platform environment The invention can be imple- 
mented in numerous ways, including as a method, a 
computer system, and an apparatus. Several embodi- 
ments of the Invention are discussed below. 
[0010] According to one aspect of the present inven- 
tion, an apparatus lor compiling a platform specific com- 
piler having an embedded instruction selector is de- 
scribed. The apparatus includes a set ot user defined 
platform dependent compiler architecture descrfptors 
that describe corresponding architectural features of a 
particular hardware platform and instruction predicates 
used by the instruction selector. An architecture descrip- 
tor compiler converts the user defined platform depend- 
ent compiler architecture descriptors into the platform 
dependent compiler source code and instruction selec- 
tor source which is converted into platform dependent 
object code and instruction selector object code by a 
host compiler. The compiler is formed from platform in- 
dependent compiler object code and the platform de- 
pendent compiler object code in combination with the 
instruction selector object code. 

[0011] As a method for building a platform specific 
compiler, a set erf user defined platform dependent com- 
piler architecture descriptors that describe correspond- 
ing architectural features of a particular hardware plat- 
form dependent compiler and instruction predicates are 
provided. The descriptors and instruction predicates are 
converted into platform dependent compiler source 
code and instruction selector source code by an archi- 
tecture descfiptor compiler. The platform dependent 
compiler source code and instruction selector source 
code is complied into piatfomn dependent object code 
and instruction selector object code. The platform spe- 
cific conpiier having the embedded instruction selector 
is formed from the platform dependent object code, the 
instruction selector object code, and platform independ- 
ent compiler object code. 

[0012] In another embodiment, a platform specific 
compiler is disclosed. The compiler includes platform 
dependent compiier object code having embedded in- 
struction selector object code and platform Independent 
compiler object code which are suitable for executitMi 



on a particular hardware platform. An interface that is 
partially embedded in the platform independent code 
and partially embedded in the platform dependent ob- 
ject code mediates flow of information between the plat- 
5 form independent compiler code and the platform de- 
pendent compiler code during platform specific compiler 
run time. 

[001 3] These and other advantages of the present in- 
vention will become apparent upon reading the following 
10 detailed descriptions and studying the various figures of 
the drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 [0014] The invention, together with further advantag- 
es thereof, may best be understood by reference to the 
following description taken in conjunction with the ac- 
companying drawings in which: 

Fig. 1 is a representative block diagram of a multi- 
piatfornn compiler system In accordance with an em- 
bodiment of the invention; 

Fig. 2 illustrates a particular implementation of the 
multi-platform compiler shown in Fig 1 ; 
Fig 3 shows another implementation of the compil- 
er shown in Fig. 2; 

Fig. 4 shows a flowchart detailing a ccHTipiler build- 
ing process in accordance with an embodiment of 
the invention; 

30 Fig. 5 illustrates a Java Virtual Machine (JVM) heav- 
ing a platform specific compiler in accordance with 
an embodiment of the invention; 
Fig. 6A illustrates an exemplary AD file organization 
in accordance with an embodiment of the invention; 

35 Fig. 6B illustrates a particular relationship between 
various data fields included in the AD file shown in 
Fig. 6A; 

Fig. 7 illustrates an exemplary interface coupling 
platfomi dependent source code and piatform inde- 
pendent source code in accord^ce with an embod- 
iment of tho invention; 

Fig. 8 is an exemplary representation of a run-time 
process by the compilation engine in accordance 
with an embodiment of the invention; 
4S Fig 9 is a representation of a machine independent 
instruction; 

Fig. 10 is flowchart detailing the instruction selec- 
tion process in accordance with an embodiment of 
the invention; and 
^0 Fig. 11 illustrates a computer system employed to 
implement the invention. 

DETAILED DESCRIPTION OF THE EMBODIMENTS 

55 [001 5| in the following description, frameworks and 
methods of selecting instructions in a multi-platform 
computing environment are described. The inventlOT 
will initially be described in terms of a multi-platform 
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compiler residing in a Java virtual nnaciifne. In general, 
In order to build a platform specific compjler havir^g an 
embedded instruction selection code, a set of user de- 
fined platform specific archrtecture descriptors in the 
fornn of an architecture description language ( AD L) file 
and a set of user defined instruction predicates are pro- 
vided. It should be noted that the ADL can take many 
forms well known lo those skilled the arl such as C-n-, 
Attribute Grammars, Custom Description Languages, 
etc., or some combinaticn of these forms. In the de- 
scribed embodiment, the instruction predicates take the 
form of boolean expressions that in conjunction w^th in- 
St ruction selection code embedded m the platform spe- 
cific compiler, enable only those instructions that are to 
be executed. In one embodiment of the invention » the 
determination of the boolean expression's value may 
only be possible during instruction selection. 
[0016] The AD file includes the user defined instruc- 
tion predicates that form an input to a multii^latform 
compiler builder that includes an architecture design 
language compiler (ADLC). In one embodiment, the 
ADLC is used to generate particular target platform de- 
pendent compiler source code used to build the target 
specific compiler. The ADLC also generates target plat- 
form instruction seiection source code and target plat- 
form instruction predicate source code. The platform 
specific compiler having an embedded instruction se- 
lector is built using the platform dependent compiler 
source code, the instruction selection source code, and 
the instruction predicate source code provided by the 
ADLC in combination with the platform independent 
compiler object code otherwise provided. 
[0017] In this way, the platform specific compiler se- 
lects not only those instructions deemed by the user to 
be suitable for execution on the target plaifoim but those 
instructions which improve processor perlormance by 
reducing instryction execution costs. In this way, the re- 
liability and performance of the processor in a multhplat- 
form environment are substantially improved 
[0018] Fig. 1 IS a representative block diagram of a 
multi-platform compiler system 100 in accordance with 
an embodiment of the invention. The multi-platform 
compiler system 100 is capable of building, or compiling, 
a platform specific compiler capable of selecting instruc- 
tions appropriate to the target platform In addition, the 
compiler is also capable of selecting those instructions 
that improve overall processor performarK^e by reducing 
instruction execution cost. 

[0019] In the described embodiment the platform 

specific compiler is built by compiling platform inde- 
pendent source code representing those portions of the 
platform specific compiler that are independent of any 
particular platform and platform dependent source code 
representing those features of the compiler that are plat- 
form specific, in those cases v/here user defined (ex- 
plicit) instruction predicates are provided, or where it is 
detemr^ined that particular instructions require instruc- 
tion predicates regardless whether or not they are user 
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supplied (impiicll), the compiler Is built using instruction 
predicate source code that provides the platform specif- 
ic compiler with the aJorementioned selection capability. 
[0020] More partk^ularly, in the described embodi- 

5 ment, the multi -platform compiler system 100 includes 
a compiler builder 1 02 arranged to build the platform de- 
pendent compiler using platform specrfic architecture 
descriplors and inslrucUon predicaLes. In the described 
embodiment, the platform specific architecture descrip- 

10 tors take the form of architecture description language 
(ADL) used to represent those platform dependent por- 
tions of the target platform compiler while the instruction 
predicates take the form of boolean expressions. It 
should be noted that the ADL can take many forms well 
known to ^ose skilled in the art such as C++, Java 
source code, Pascal, etc. Typically, it is the AD file that 
is coded by the user of the multi-platform compiler sys- 
tem 1 GO, however, in some cases, the AD file is provided 
by original equipment manufacturers in situations re- 

20 f erred to as *'turn key'* systems. In these situations, the 
end user has selected which target platforms are 
deemed to be useful and the supplier has provided the 
necessary coding efforts. 

[0021] In many instances, the AD file is located in, lor 

2S example, memory systems external to the compiler 
builder 102 such as an AD file 104 and an AD file 106. 
It should be noted that although any number of AD files 
can be provided, each specific to a particular platform, 
however, only one AD file at a time is processed by the 

^0 compiler builder 1 02 In this way when platform depend- 
ent source code specific to, for example, a platform type 
1 is provided, the customized compiJer builder 1 02 is ca- 
pable of compiling source code into platform type 1 ob- 
ject code. For example, the AD file 1 04 can include ADL 

35 code having instruction predicates used by the compHer 
builder 102 to build an X86 compiler capable ol preler- 
entlally selecting those instructions enabled by the user 
defined Instruction predicates. As well known in the art. 
X86 object code are those instructions executable by an 

■^0 X86 microprocessor manufactured by the Intel Corpo- 
ration of Santa Clara, CA. 

[0022] Along the same lines, the AD file 106 can in- 
clude ADL code having instruction predicates used by 
the compiler builder 102 to build, for example, a SPARC 
■^5 compiler capable of preferentially selecting those in- 
structions enabled by the user defined instruction pred- 
icates. 

[(K)23] In both cases, the overall performance of the 
processor can be improved since a cost analysis is also 

^0 performed substantially simultaneously with the selec- 
tion. In this way, the compiler builder 102 is capable of 
aulonrvatically providing as many of the platform specific 
compilers having appropriate instrudion selection code 
as there are AD files and corresponding user defined 

5^ instruction predicates as are available. 

[0024] Referring now to Fig. 2 illustrating a particular 
implementation of the multi*platform compiler builder 
1 02 shown in Fig 1 . In the embodiment shown, the com- 
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pMer buildef 102 includes an architecture description 
language compiler 202 (ADLC) coupled to an AD file 
204 having user defined instruction predicates file 201 
The ADLC 202 is arranged to connpiie the platlonn spe- 
cific ADL code included in the AD file 204 and the in- 
struction predicates included in the file 201 to platform 
dependent compiler source code, instruction seleclor 
source code, and inslruclion predicate source code, re- 
spectively. In addition^ the ADLC 202 is capable of gen- 
erating instruction predicates that are not explicitly writ- 
ten in the AD file 204 in order to ensure a correct or el- 
ficient instruction selection, heretofore referred to as im- 
plicit instruction predicates. 

[002S] The platform dependent ctxnpiler source code, 
the instruction selector source code, and the instruction 
predicate source code are, in turn, provided to a host 
compiler 203 coupled to the ADLC 202. In the described 
embodimenL the host connpiier 203 is a C++ connpiier 
well known to those skilled in the art. However, any com- 
piler suitably arranged to compile source code to target 
specific compiler object code can be used. 
[0026] During compile time, the host compiler 203 
coupled to the ADLC 202 compiles the platform depend- 
ent source code in combination with the platfomn de- 
pendent instruction predicates source code and instruc- 
tion selector source code to form the platform depend- 
ent compiler objecl code having embedded instruction 
selector object code. In the described embodiment, the 
platform dependent compiler object code is represented 
by a code block 206 that includes the embedded instruc- 
tion selector objecl code represented by a code block 
216 included in a compiler unit 208 coupled to the host 
compiler 203. The embedded instruction selector object 
code provides the customized compiler unit 208 with the 
capability of selecting those instructions deemed appro- 
priate by the user. In addition, by performing a cost anal- 
ysis, those instructions which reduce instruction execu- 
tion cost are preferentially selected thereby improving 
processor performance by reducing overall instruction 
execution cost. 

[0027] In some cases, the platform independent com- 
piler object code is derived Irom platform independent 
compiler source code compiled by the host compiler 
203. In other cases, such as the described embodiment, 
the piatform independent compiler object code Is al- 
ready provided In the described embodiment, the plat- 
form independent object code is represented by a code 
block 210 that includes, in one embodiment, a compila- 
tion engine 212 coupled to a platform independent in- 
terface 214. In a particular implementation, during what 
is referred to as run-time, the interface 21 4 mediates the 
transfer o1 intormatlon between selected portions of the 
platform dependent object code and instruction selec- 
tion object code to the compilation engine 212 
[002S] By way of example, when an X86 microproc- 
essor is required, the user provides, lor this example, 
the AD file 204 having stored therein ttie appropriate 
ADL code specific to the X86 platfomi and the file 201 



having the appropriate instruction predicates. As direct- 
ed by the compiler builder 102. both the X86 specific 
ADL code and the instruction predicates are supplied to 
the ADLC 202. The ADLC 202 then compiles the ADL 

5 code to X86 specific compiler source code and the in- 
struction predicates to corresponding platform specific 
instruction predicates source code and instruction se- 
lector source code. It should be noied that the ADLC 
202 is a universal compiler capable of converting any 

10 properly constructed AD file into corresponding platform 
specific compiler source code and, if required, platform 
dependent instruction predicate source code. In this 
way, any requirements that the user code, or in any way 
modify the ADLC 202 or any portion thereof are sub- 

^5 stantially eliminated, Therefore: the only coding re- 
quired of the user is that required to provide a properly 
constructed and verified AD file that can include, if nec- 
essary, instruction predicates. 

[0029] Once the ADLC 202 has compiled the XS6 

^0 specific ADL code to X86 compiler source code and the 
instruction predicates into instruction selector source 
code: they are compiled to object code using host com- 
piler 203 and represented by the biocks 206 and 216, 
respectively. 

25 [0030] In some cases, multiple platform dependent 
compiler source code files are made available to a com- 
piler unit 300 shown in Fig. 3. The compiler unit 300 is 
one implementation of the compiler unit 208 shown in 
Fig. 2 and, as such, ^ould only be consider exemplary 

30 in nature. In the described embodiment, the various plat- 
form dependent AD f^ies are stored in a file stack 301. 
The tile stack 301 can be local to the compiler unit 300 
or, in some cases, can be rennotely located in. for exam- 
ple, data bases, remote servers, etc. This arrangement 

35 is particularly well suited for applications involving trans- 
ferring dala over coupled computer networks, such as 
the Internet, local area networks (LANs), and the like. 
This use of multiple AD files is particularly advantageous 
since it provides the compiler unit 300 with the capability 

40 of operating, as needed, as any platform specific com- 
piler rGprosentod by the corresponding AD file. By way 
of example, the file stack 301 includes platform an AD 
file 302 representing, for example, SPARC specific de- 
scriptors along with their corresponding instruction 

4S predicates which is converted to corresponding platform 
dependent source code 304 and instruction selector 
306. With this arrangement, any platform having its cor- 
responding AD file included in the file stack 301 could 
be selected to customize the compiler unit 1^0. Al- 

^0 though not shown, a seleclor is typically employed to 
select which AD file is to be input to the ADLC 202. 
[0031] As discussed above, the compiler builder 102 
builds a particular compiler having an instruction selec- 
tor, using in one embodiment, a process 400 detailed by 

55 the flowchart shov\/n in Fig. 4. The process 400 begins 
at 402 by providing platform specific architecture de- 
scriptors in the form of ADL code stored in, for example, 
an AD file. At 404 a determination is made whether or 
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not the AD file includes user defined instruction predi- 
cates (explicit) or whether or not implicit instruction pred- 
icates are required. Impiicit instruction predicates are re* 
quired when, for example, it is essential tor proper In* 
struction executionthat particular data be present in par* 
ticular locations. Such a situation occurs with those in- 
structions having multiple occurrences of a, particular 
operand where IL is essential that each operand must 
be the same otherwise a processing error is likely to oc- 
cur. Since the risks of error is Nkely to vary from platform 
to platform (i.e., executing a 3 address instruction on a 
2 address platform versus executing a 2 address in- 
struction on a 3 address platform), relying on the user 
to comprehend this fact is risky. In addition, an instruc* 
tion predicate can require the use of implementation de- 
tails of the compiler unit that may not be available to the 
writer of the AD file Therefore, implicit Instruction pred- 
icates are a part of the instruction selection process in 
that those particular instructions requiring special con- 
sideration are provided with the appropriate instruction 
predicate by the ADLC. 

[0032] When explicit and/or impiicit instruction predi- 
cates are required, the ADLC provides the instruction 
selector source code at 406. Substantially simultane- 
ously with 406= the ADLC provides platform specific 
compiler source code at 408 based upon the AD file in- 
puts, whereaS; platform independenl compiler source 
code is provided at 410. The host compiler then builds 
the platform specific compiler having instruction selec- 
tion and execution cost analysis capability by compifing 
the platfomn independent source code, the platform de- 
pendent compiler source code, and the instruction pred- 
icates source code concurrently at 412. 
[0033] Once the platform specific compiler has been 
built at 414, the platform specific compiler is available 
during run-time for selecting those cosi effeciive instruc- 
tions that are compiled into platform specific object code 
as needed. In a particular embodiment, during run-time, 
the platform independent interface mediates the flow of 
information between the compilation engine and the 
platform dependent compiler source code and instruc- 
tion predicate source code. 

[0034] More recently, the Java programming lan- 
guage, an object-oriented language, has introduced the 
possibility of compiling output (called bytecode) that can 
run on any computer system platform lor which a Java 
virtual machine (or bytecode interpreter) is provided. 
The Java virtual machine is ctesigned to convert the 
bytecode into instmctions that can be executed by the 
actual hardware processor. Using this virtual machine, 
rather than being interpreted one instruction at a time, 
bytecode can be recompiled at each particutar system 
platform by, in some cases, a Just-in-time (JIT) compiler. 
[0035] Fig. 5 illustrates an apparatus that includes a 
Java Virtual Machine (JVM) 500 incorporating the com- 
piler unit 208 in accordance with an embodiment of the 
invention. In the described arrangement, a platform spe- 
cific AD file stack 502 coupled to the ADLC 202 includes 
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a group of AD fiies each representing particular platform 
dependent compiler features. In the described embodi- 
ment, the AD file stack 502 includes the AD file 1 04 and 
AD tile 106 representing the X36 and SPARC architec- 

5 tures. respectively. In the case where a number of dif- 
ferent AD files are included in the AD file stack 502, a 
selector unit (not shown) is typically used to select a par- 
ticular AD file from the AD file slack 502 corresponding 
to the desired operating platform. When the af^ropriate 

10 AD file is selected, the ADLC 202 converts the ADL code 
included in the selected AD file into appropnate platform 
dependent compiler source code as discussed above. 
[0036] In the Java programming language and envi- 
ronment, a just-in-time (JiT) compiler is a program that 

^5 turns Java bytecode into instructions that can bG sent 
directly to the processor. After a Java program has been 
written, the Java soorce language statements are com- 
piled by the Java compiler into Java bytecode rather 
than into code that certains instructions that match a 

20 particular hardware platform's processor (for example, 
an Intel Pentium microprocessor or an IBM System/390 
processor). The Java bytecode is platform-independent 
code that can be sent to any platform and run on that 
platform. 

[0037] More particularly when bytecodes are provid- 
ed to a JIT compiler provided by the compiler unit 208, 
the compilation of methods contained in bytecodes 504 
is delayed until the methods are about to be executed. 
When bytecodes 504 are provided to an interpreter 506, 

30 bytecodes 504 are read into interpreter 506 one byte- 
code at a time. Interpreter 506 then performs the oper- 
ation defined by each bytecode as each bytecode is 
read into interpreter 506. That is, interpreter 506 "inter- 
prets" bytecodes 504, as will be appreciated by those 

35 Skilled in the art. in general, interpreter 505 processes 
bytecodes 504 and performs operations associated with 
bytecodes 504 substantiafly continuously. 
[0038] When a method is compiled, the compiler unit 
208 generates machine instructions as selected by an 

^0 instruction selector 508. The compiler unit 208 then gen- 
erates machine instructions from the selected byte- 
codes 504, and the resulting machine language instruc- 
tions may be executed directly by the target platform op- 
erating system 510, In general, the machine-language 

4S instructions are discarded when virtual machine 500 ter- 
minates. 

[0039] Referring now to Fig. 6 illustrating the organi- 
zation of an AD file 600 in accordance with an embodi- 
ment o1 the invention. It should be noted that the organ- 

^0 ization shown is one of the many possible organizations 
that AD file 104 can take. In the described embodiment, 
the AD file 600 is s hierarchically organized set of dis- 
tinct platform architecture descriptor data fields. By way 
of example, a register definition data field 602 is used 

55 by the ADLC to describe ndividual registers and classes 
of registers with the target architecture. An encoding 
block data field 604 specifies the encoding classes used 
by the target compiler to output byte streams, A frame 
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management block data field 606 includes information 
that defines the frame structure and management pro- 
tocols. Such information defines, for example, what di- 
rectionthe frame stack grows, the number of stack slots 
consumed byamonitor enter ope ration, stack alignment 
requirements, number of stack slots reserved for "Top 
of Stack", amongst others. 

[0040] An operand dala frefd 608 provides operand 
definitions that must precede instruction definitions for 
correct compilation in the ADLC since operands consti- 
tute user defined types vy^ich are used in instruction def- 
initions. A pipeline rules data field 610 is provided to de- 
fine the behavior of the target architecture pipeline. An 
^nstructicm definiticnis data field 61 2 provides instruction 
tormats for the target architecture as well as corre- 
sponding instruction predicates. A peephole data field 
614 provides target architecture specific optimization 
rules used by the ADLC. 

[0041] The hierarchical organization of the AD file 600 
underfies the interrelationship amongst the various AD 
input data fields. Fig. 6B graphically illustrates one such 
relationship, specifically, the relalionship between the 
various operands included in the instruction definitions 
data field 612. By performing an backwards traversal 
from using the instruction definitions data fiefd 612 as 
the root, the relational dependencies for the various op- 
erands required to be input to the AD input data field is 
determinable. For example, by performing an upward 
traversal starting from the instruction definition data field 
61 2 as the root and extending upward along the various 
branches, the pipefine rules data field 610 and operand 
definitions data field 608 are encountered. Performing 
an upv\/ard traversals from the from the pipeline rules 
data fiekl 612 is the register definllions data field 602, 
while the register definitions data field 602 and the en- 
coding class dala field 6D8 are encountered when an 
LI p wards t rave rsa I from the ope rand def in iticsis data f iekd 
BOB is performed. In this way, the various operands re- 
quired to fully define a particular instruction definition is 
provided. 

[0042] Fig. 7 illustrates an exemplary interface 700 
used by the platform dependent compifer in accordance 
with an embodiment of the invention. The interface 700 
mediates communication between the compiler engine 
212 (or the platform independent object code) and the 
platform dependent object code in the block 206 in the 
embodiment shown, the ADLC output includes object 
code for a deterministic finite automaton (DFA) 702 that 
specifies the mapping from ideal operations to machine 
instructions. The ADLC output also Includes object code 
that defines a set of instruction classes 704 that are used 
to, lor example, define legal register masks, encoding 
methods, branch olfset behavior, etc. Object code for a 
peephole rules oracle706 that specifies machine specif- 
ic trees that are legal to optimize and what the correct 
replacement is for those trees is also output by the 
ADLC. In this way, the plattorm specific architecture 
characteristics are automatically provided in a format 



suitable for the compilation engine 212 to use in com- 
piling source code into platform dependent object code 
[0043] The target independent portion of the interface 
700 is coupled to a matcher 70S which generates input 

5 trees to be processed by the DFA 702 that performs bot- 
tom up rewrite rule system (BURS) style tree pattern 
matching in orderto select machine instructions for ideal 
operations in the inlermediale represenlaUon, 
[0044] Ttie interface 700 also includes object code ar- 

10 range to act as a peephole DFA that perfomns tree pat- 
tern matching to find optimization candidates and re- 
places matched trees of machine instructions with opti- 
mized trees of machine instructions. A matcher 70S per- 
forms Instruction saiection using the matcher DFA and 

^= builds machine specific intermediate representations. A 
scheduler 71 0 orders the machine specific intermediate 
representations while a register allocator 712 selects a 
legal assignment of registers to operands in the ma- 
chine specific representation. This includes the insertion 

^0 of any instructions necessary to relocate values to prop- 
er locations (such as moving arguments totherr appro- 
priate location specified by the calling convention). A 
peephole optimizer 714 pattern matches small trees in 
the machine specific representation and replaces them 

25 with more optimal machine specific trees. An object 
code outpul 716 uses virtual calls to encode machine 
specific representation as machine object code in a buff- 
er and makes the buffer available to the virtual machine. 
[0045] Fig. 8 is an exemplary representation of an in- 

30 struction compilation process carried out during run- 
time by the compilation engine 212 in accordance with 
an embodiment of the invention. During the run-time, the 
compilation engine 212, when required, generates an 
information request (referred to, in a particular embodi- 

35 ment, as an emit function call) for specific information 
from the platform dependent compiler object code and, 
if necessary, the instruction selector object code. By way 
of example, when the cximpilation engine 212 requires 
platform specific information in order to select an in- 

40 Struction using the embedded instruction selector the 
compilation engine 2 1 2 executes a function call to a in- 
formation requestor 802. The information requestor 
802, in turn, is coupled to the interface 700 which directs 
the function cail to a information retriever 804 coupled 

45 to the DFA having the required instruction predicates In 
the described embodiment, the DFA then selects the in- 
struction based upon the corresponding instructron 
predicates. The information retriever 804, in turn, re- 
sponds to the function call by retrieving the requested 

50 information. It should be noted that the retrieved infor- 
mation is structured in such a manner as to be readily 
used by the compilation engine 212. 
[0046] Fig. 9 represents an exemplary machine inde- 
pendent graph 900 of an instruction having multiple re- 

5S currences o1 an operand in accordance with an embed- 
imentof the invention. The machine independent graph 
900 is mapped to those machine dependent instructions 
having the same semantics. For example, tfie graph 900 
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node 902 represents at the most basic levef a subtrac- 
tion operation that would maplo any number of machine 
dependent subtraction operation (subA, subB, subC) 
having the same semantrcs co-responding to subtrac- 
tion. However as discussed previously, not all of the 
possible mappings are desirable since some may in fact 
result in processing errors. This can occur, Tor example, 
in those siluations where Ihe large! platform is a 2 ad- 
dress processor (such as an X66) and the result of the 
subtraction operation is required to be stored into a lo- 
cation distinct from that of the inputs (as can be done by 
3 address instructions in Ihe SPARC architecture). Typ- 
ically, a bottom up matching process is used to break 
down the binary tree representation of the machine in- 
struction into semantics that the target platform can un- 
derstand and execute. In this example, each node of the 
tree is mapped to a target dependent instruction as se- 
lected by the instruction predicate using the instruction 
selection code generated by the ADLC. 
[0047] The root instruction node 902 indicates that the 
instruction 900 is an integer subtraction operation hav- 
ing as inputs the results of node 904 and node 906. The 
result of the subtraction operation is stored rn a register 
at 908. In this example, the instruction selector selects 
the most cost effective instruction or sequence of in- 
structions out of all matching instructions to be execut- 
ed. 

[0048] Fig. 10 is a flowchart detailing an instruction 
selection process 1000 in accordance with an embodi- 
ment of the invention. The process 1000 begins at 1002 
by locating a node in the machine independent binary 
tree representation of the instruction. At 1 004, the in- 
structi<xi selection code corresponding to the located 
node Is obtained and executed. The next potential user 
defined Instruction that can matches the located node 
is then checked at 1006. This matching is based upon 
matching comparing the semantics between the ma- 
chine independent representation of the node and the 
target platform representation of the node. At 1008, a 
determination is made whether or not the location of in- 
puts of the user defined instruction matches the location 
at inputs for the located node. If it is determined that 
there is match, then a determination is made at 1010 
whether or not the instruction predicate is satisfied. If 
the instruction does not match or the instruction predi- 
cate is not satisfied, then control is passed back to 1 006 
where the next pc^ential user defined instruction that 
matches the node is selected. 

[0049] Returning back to 1010. If the instructicMn pred- 
icate is satisfied, then an estimate of the execution cost 
is made for the user defined instruction at 1012, A de- 
terminaticxi is made at 1 014 whether or not the cost es- 
timate is less than the previous cost estimate. If the cost 
estimate is less, then the cost estimate variable is up- 
dated at 1016 and Ihe best instruction is updated at 1018 
after which a determination at 1020 is made whether or 
not there are more user defined matching instructions. 
If there are no matching instructions then the prccess 



stops, however if there are more matching instructions, 
then control is passed back to 1006. 
[0050] Returning to 1014, if it is determined that the 
cost estimate is not less, then it there are additional 
£ matching instructions, control is passed back to 1006, 
otherwise the process stops. 

[0051] Fig. 11 illustrates a computer system 1100 em- 
ployed to implement the invention. The computer sys- 
tem 1100 or more specifically, CPUs 1102, may be ar- 

10 ranged to support a virtual machine, as will be appreci- 
ated by those skilled in the art. As is well known in the 
art. ROM acts to transfer data and instructions unidirec- 
tionally to the CPUs 1102, while RAM is used typically 
to transfer data and instructions in a bi-dirsctional man- 

^5 ner CPUs 1102 may generally include any number ot 
processors. Both primary storage devices 1104, 1106 
may include any suitable computer-readable media A 
secondary storage medium 1106, which is typically a 
mass memory device, is also coupled bi-directionally to 

^0 CPUs 1 1 02 and provides additional data storage capac- 
ity. The mass memory device 1108 is a computer-read- 
able medium that may be used to store programs includ- 
ing computer code, data» and the like. Typically, mass 
memory device 1108 Is a storage medium such as a 

25 hard disk or a tape which generally slower than primary 
storage devices 1104, 1106. Mass memory storage de- 
vice 11 08 may take the form of a magnetic or paper tape 
reader or some other well-known device. It will be ap- 
preciated that the information retained within the mass 

30 memory device 1103, may, m appropriate cases, be in- 
corporated in standard fashion as part of RAM 1106 as 
virtual memory. A specific pnmary storage device 1104 
such as a CD-ROM may also pass data unidirectional !y 
to the CPUs 1102. 

35 [0052] CPUs 1102 are also coupled to one or more 
^nput/outpui devices 1110 that may include, but are not 
limited to, devices such as video monitors, track balls, 
mice, keyboards, nnicrophones, touch-sensitive dis- 
plays, transducer card readers, magnetic or paper tape 

40 readers, tabletS: atyfuses. voice or handwriting recog- 
nizers, or other well-known input devices such as, of 
course, other computers. Finally, CPUs 1102 optionally 
may be coupled to a computer or tGiecommunicattons 
network, e.^., an Internet network or an intranet net- 

^5 work, using a network connection as shown generally 
at 1112. With such a network connection, it is contem- 
plated that the CPUs 1102 might receive information 
from the network, or might output information to the net* 
work in the course of performing the above-described 

50 method steps. Such information, which Is often repre- 
sented as a sequence of instructions to be executed us- 
ing CPUs 1102, may be received from and oulputted to 
the network: for example, in the form of a computer data 
signal embodied in a carrier wave. The above-described 

55 devices and materials will be familiar to those of skill in 
the computer hardware and software arts. 
[0053] Although only a few embodiments ot the 
present invention have been described, it shouki be un- 
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derstood that the present invention may be embodied 
in many other specific forms without departing froiri the 
spirit Of the scope of the present invention, By way of 
example, the multi-platform compiler can be used in any 
computing system. 

[0054] Although the methods of porting a compiler 
from one operating system to another different operat- 
ing system in accordance with the present invention are 
particularly suitable for implementation with respect to 
a Java™ based environment, the methods may gener- 
ally be applied in any suitable object-based environ- 
ment In particular, the methods are suitable for use in 
platform-independent object-based environments, it 
should be appreciated that the methods may also be im- 
plemented in some distributed object-oriented systems. 
[0055] While the present invention has been de- 
scribed as being used with a distributed object based 
computer system, it should be appreciated that the 
present invention may generally be Implemented on any 
suitable computing system having a compiler. There- 
fore, the present examples are lo be considered as il- 
lustrative and not restrictive, and the invention is not to 
be limited to the details given herein, but may be modi- 
tied within the scope of the appended claims along with 
their fuli scope of equivafents. 



Claims 

1. An apparatus for generating a platform specific 
compiler having an embedded instruction selector, 
comprising: 

a set of user defined platform dependent com- 

piler architecture descriptors that describe cor^ 
responding architectural features ol a particular 
hardware platlorm dependent compiler; 
a set of instruction predicates used lo identify 
platlorm specific instructions selected by the in- 
struction selector; 

an architecture descriptor compiler arranged to 
convert the user defined platform dependent 
compiler architecture descriptors into the plat- 
fomi dependent compiler source cods and ar- 
ranged to convert the set of instruction predi- 
cates into platform specific instructor selector 
source cods; 

a host compiler arranged lo compile the plat- 
form dependent compiler source code into plat- 
form dependent compiler object code and ar- 
ranged to compile the platlorm specific instruc- 
tor selector source code into the embedded in- 
stmction selector object code; 
platform independent compiler object code; 
and 

an interface arranged to mediate the flow of in- 
formatbn between the platform dependent 
compiler object code and the platform inde- 



pendent compiler object code during run time 
for the platform specific compiler wherein dur- 
ing platform specific compiler run time, the em- 
bedded Instructbn selector selects the instruc- 

5 tion to be compiled based upon the execution 

cost of the selected instruction, and wherein the 
embedded instruction selector provides implicit 
instruction predicates used by the platform spe- 
cific compMer to compile the selected instruc- 

fo tion. 

2. An apparatus as recited in claim 1 , wherein the plat - 
form specific compiler includes platform independ- 
ent compiler object code and platform dependent 
compiler object code suitable for execution of the 
particular hardware platform. 

3. An apparatus as recited in claim 2, wherein during 
platform specific compiler run time, the platform in- 

20 dependent compiler object code requests specific 
platform dependent object code information by pro- 
viding an information request to the interface which 
directs the information request to a pre-determined 
platform dependent compiler object code informa- 

^5 tion retriever 

4. An apparatus as recited in claim 3. wherein the plat- 
form d^endent compiler object code information 
retriever responds to the information request by re- 

30 trieving specific platform dependent compiler object 
code rnlormation in satisfaction ol the information 
request 

5. An apparatus as recited in claim 4, wherein the re- 

35 trieved information is provided to the interface 
which, in turn^ directs the information to the infor- 
mation requestor 

6. An apparatus as recited in claim 1 , wherein said ap- 
^0 paratus comprises a plurality of sets of user defined 

platform dependent architecture descriptors, 
wherein each of which corresponds to a different 
hardware platform. 

-^o 7. A method of building a platform specific compiler 
having an embedded rnstryction selector, compris- 
ing: 

providing a set of user defined platform de- 
pendent compiler architecture descriptors that 
describe corresponding architectural features 
of a particular hardv^^re platfoim dependent 
compiler; 

providing a set of user defined instruction pred- 
55 icates used by the embedded instruction selec- 

tor to select those instructions lo be compiled 
by the platform specific canpiler during run 
time; 
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converting the set of user defined platform de- 
pendent compiler architecture descriptors into 
platform dependent compiler source code by 
an architecture descrrptor compiler; 
converting the set of user defined instruction 5 
predicates into platform instruction selector 
source code by an architecture descriptor com- 
piler; 

compiling the platform dependent compiler 
source code into platform dependent object to 
code by a host compiler coupled to the archi- 
tecture descriptor compiler: 
compiling the instruction selector source code 
into instruction selector object cods by a host 
compiler coupled to the architecture descriptor 
compiler; 

providing platform independent compiler object 
code, wherein the platform independent com- 
piler object code and the platform dependent 
compiler object code are suitable for execution 
on the parlicular hardware platform: and 
forming the embedded instruction selector from 
the platform dependent compiler object code, 
the instruction selector object code, and the 
platform independent ccMnpiler object code, 25 

8, A method as recited in claim 7, further comprismg: 

requesting specific platform dependent object 
code information by the platform independent 
compiler object code by the piatform independ- 
ent object code dunng platform specific compil- 
er run lime; 

providing an information request to the inter- 
face: and 3B 
directing the information request to a pre-deler- 
mined platform dep^dent compiler object 
code information retriever by the intertace. 

9, A method as recited in claim 8, further comprising: 

retrieving specific platform dependent compil- 
er object code information in satisfaction of the in- 
formation request in response to the request by the 
information retriever. 

45 

10, A method as recited in claim 9, further comprising: 

dir^ting the retrieved information to the infor- 
mation requestor by the interface, 

11, A method as recited in claim 10, further comprising: so 

directing the retrieved information to the infor- 
mation requestor by the interface. 

12, A platform specific compiler having an embedded 
instruction selector, comprising: 

a platform dependent compiler object code; 
a platform independent compiler object code. 



wherein the platform independent compiler ob- 
ject code and the platform dependent compiler 
object code are suitable for execution on a par- 
ticular hardware platform; 
a piatform dependent instruction selector ob- 
ject code embedded in the platform dependent 
compiler object code: 

an interface partially embedded in the platform 
independent code and partially embedded m 
the platform dependent object code, wherein 
during platform specific compiler run time, the 
interface mediates flow of information between 
the platform independent compiler code and 
the platform dependent compiler code. 

13. A compiler as recited in claim 12, wherein during 
platform specific compiler run time, the platform in- 
dependent compiler object code requests specific 
piatform dependent object cod© information by pro- 
viding an information request to the interface which 
directs the information request to a pre-date rmined 
platform dependent compiler object code informa- 
tion retrtever. 

14. An apparatus as recited in claim 13, wherein the 
piatform dependent compiler object code informa- 
tion retriever responds to the information request by 
retrieving specific platform dependent compiler ob- 
ject code information in satisfaction of the informa- 
tion request. 

15» An apparatus as recited in claim 14, wherein the re- 
trieved intormation Is provided to the interlace 
which, in lurn^ directs the information to the infor- 
mation requestor. 

16. An apparatus as recited in claim 12, wherein said 
apparatus comprises a plurality of sets of user de- 
fined platform dependent architecture descriptors, 
wherein each of which corresponds to a different 
hardware platform. 
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